Towards Scene Understanding: Deep and Layered Recognition and Heuristic Parsing of Objects
نویسندگان
چکیده
The above researches were supported by The National Basic Research Program of China under Grant Nos. 2006CB708303 (advanced research) and 2007CB311005, The National High-Tech Research and Development Plan of China under Grant Nos. 2006AA01Z192 and 2005AA147060, and The National Natural Science Foundation of China under Grant No. 90820017. I Title: Towards Scene Understanding: Deep and Layered Recognition and Heuristic Parsing of Objects Speciality: Control Science and Engineering Applicant: Yang Wu Supervisor: Prof. Nanning Zheng
منابع مشابه
Scene parsing using inference Embedded Deep Networks
Effective features and graphical model are two key points for realizing high performance scene parsing. Recently, Convolutional Neural Networks (CNNs) have shown great ability of learning features and attained remarkable performance. However, most researches use CNNs and graphical model separately, and do not exploit full advantages of both methods. In order to achieve better performance, this ...
متن کاملLearning Scene Gist with Convolutional Neural Networks to Improve Object Recognition
Advancements in convolutional neural networks (CNNs) have made significant strides toward achieving high performance levels on multiple object recognition tasks. While some approaches utilize information from the entire scene to propose regions of interest, the task of interpreting a particular region or object is still performed independently of other objects and features in the image. Here we...
متن کاملHierarchical Feature For Scene Parsing Using Fully Recurrent Network
In scene parsing, the wide-range contextual information is not effectively encoded. Scene parsing provides segmentation and determines an scene into different regions associated with semantic categories. The main objective of scene parsing is to reduce semantic gap between humans and computer machines on scene understanding. The scenes parsing applications are object detection, text detection o...
متن کاملBasic level scene understanding: from labels to structure and beyond Citation
An early goal of computer vision was to build a system that could automatically understand a 3D scene just by looking. This requires not only the ability to extract 3D information from image information alone, but also to handle the large variety of different environments that comprise our visual world. This paper summarizes our recent efforts toward these goals. First, we describe the SUN data...
متن کاملIntegrating Function, Geometry, Appearance for Scene Parsing
In this paper, we present a Stochastic Scene Grammar (SSG) for parsing 2D indoor images into 3D scene layouts. Our grammar model integrates object functionality, 3D object geometry, and their 2D image appearance in a Function-Geometry-Appearance (FGA) hierarchy. In contrast to the prevailing approach in the literature which recognizes scenes and detects objects through appearance-based classifi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010